Cluster-based Communication and Load Balancing for Simulations on Dynamically Adaptive Grids
نویسندگان
چکیده
The present paper introduces a new communication and load-balancing scheme based on a clustering of the grid which we use for the efficient parallelization of simulations on dynamically adaptive grids. With a partitioning based on space-filling curves (SFCs), this yields several advantageous properties regarding the memory requirements and load balancing. However, for such an SFC-based partitioning, additional connectivity information has to be stored and updated for dynamically changing grids. In this work, we present our approach to keep this connectivity information run-length encoded (RLE) only for the interfaces shared between partitions. Using special properties of the underlying grid traversal and used communication scheme, we update this connectivity information implicitly for dynamically changing grids and can represent the connectivity information as a sparse communication graph: graph nodes (partitions) represent bulks of connected grid cells and each graph edge (RLE connectivity information) a unique relation between adjacent partitions. This directly leads to an efficient sharedmemory parallelization with graph nodes assigned to computing cores and an efficient en bloc data exchange via graph edges. We further refer to such a partitioning approach with RLE meta information as a cluster-based domain decomposition and to each partition as a cluster. With the sparse communication graph in mind, we then extend the connectivity information represented by the graph edges with MPI ranks, yielding an en bloc communication for distributed-memory systems and a hybrid parallelization. For data migration, the stack-based intra-cluster communication allows a very low memory footprint for data migration and the RLE leads to efficient updates of connectivity information. Our benchmark is based on a shallow water simulation on a dynamically adaptive grid. We conducted performance studies for MPI-only and hybrid parallelizations, yielding an efficiency of over 90% on 256 cores. Furthermore, we demonstrate the applicability of cluster-based optimizations on distributed-memory systems.
منابع مشابه
Cluster Optimization and Parallelization of Simulations with Dynamically Adaptive Grids
The present paper studies solvers for partial differential equations that work on dynamically adaptive grids stemming from spacetrees. Due to the underlying tree formalism, such grids efficiently can be decomposed into connected grid regions (clusters) on-the-fly. A graph on those clusters classified according to their grid invariancy, workload, multi-core affinity, and further meta data repres...
متن کاملCluster-Based Parallelization of Simulations on Dynamically Adaptive Grids and Dynamic Resource Management
The efficient execution of numerical simulations with dynamically adaptive mesh refinement (DAMR) belongs to the major challenges in high performance computing (HPC). With simulations demanding for steadily changing grid structures, this imposes efficiency requirements on handling grid structure and managing connectivity data. Large-scale HPC systems furthermore lead to additional requirements ...
متن کاملPartitioning and Dynamic Load Balancing of Adaptive Hybrid Grids for Large-Scale Turbulent Flow Simulations
A new partitioning method for complex 3-D hybrid prismatic/tetrahedral meshes is presented, The method uses an orthogonal recursive bisection approach on a special octree corresponding to the hybrid grid. The octree is generated automatically and handles any type of 3-D geometry and domain connectivity. It is employed for partitioning of static, as well as dynamic adaptive meshes. The method yi...
متن کاملAdaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملAdaptive dynamic load-balancing with irregular domain decomposition for particle simulations
We present a flexible and fully adaptive dynamic load-balancing scheme, which is designed for particle simulations of three-dimensional systems with short ranged interactions. The method is based on domain decomposition with nonorthogonal non-convex domains, which are constructed based on a local repartitioning of computational work between neighbouring processors. Domains are dynamically adjus...
متن کامل